Semi-automatic Discovery of Mappings Between Heterogeneous Data Warehouse Dimensions
نویسندگان
چکیده
Data Warehousing is the main Business Intelligence instrument for the analysis of large amounts of data. It permits the extraction of relevant information for decision making processes inside organizations. Given the great diffusion of Data Warehouses, there is an increasing need to integrate information coming from independent Data Warehouses or from independently developed data marts in the same Data Warehouse. In this paper, we provide a method for the semi-automatic discovery of common topological properties of dimensions that can be used to automatically map elements of different dimensions in heterogeneous Data Warehouses. The method uses techniques from the Data Integration research area and combines topological properties of dimensions in a multidimensional model. Index Terms Data Warehouse, P2P OLAP, dimension integration
منابع مشابه
On the Use of Dimension Properties in Heterogeneous Data Warehouse Integration
A new trend in Business Intelligence is the process of combining information from two or more different and heterogeneous Data Warehouses. Existing solutions rely mostly on the Extract-Transform-Load (ETL) approach, a costly and laborious process. The process of Data Warehouse integration can be greatly simplified by developing methods to semi-automatically discover semantic mappings among attr...
متن کاملA Semi Automatic Tool For Schema Mapping
neric mapping framework at the schema level to address the problem of schema interoperability Providing a formalism for developing a generic, extensible, and semi-automated mapping A semi-automatic tool for schema mapping. at the University of Washington in Seattle, where he founded the database group. on Clio, the first semi-automatic tool for heterogeneous schema mapping. Keywords: data integ...
متن کاملAn a Priori Approach for Automatic Integration of Heterogeneous and Autonomous Databases
Data integration is the process that gives users access to multiple data sources though queries against a global schema. Semantic heterogeneity has been identified as the most important and toughest problem when integrating various data sources. Several approaches were proposed to deal with this problem. These approaches can be classified using three criteria: (1) data representation which mean...
متن کاملOntology-Based Conceptual Design of ETL Processes for Both Structured and Semi-Structured Data
One of the main tasks in the early stages of a Data Warehouse project is the identification of the appropriate transformations and the specification of inter-schema mappings from the data sources to the Data Warehouse. In this paper, we propose an ontology-based approach to facilitate the conceptual design of the back stage of a Data Warehouse. A graph-based representation is used as a conceptu...
متن کاملLightweight information integration through partial mapping and query reformulation
The growing amount of structured information becoming available, fostered by the advent and development of e.g. the Semantic Web and the Web 2.0 approaches, raises the need for (semi-)automatic, flexible and adaptable integration solutions. The effort invested into this partially manually created content can be leveraged by re-use and integration, so that additional communities of users can tak...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011